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Abstract 

We study the iterated Galton- Watson process (X„)„, possibly with 
thinning, introduced by Gawel and Kimmel to model the number of re- 
peats of DNA triplets during some genetic disorders. If the process in- 
volves some thinning, then extinction {X„ 0} and explosion {X„ 00} 
can have positive probability simultaneously. If the underlying (simple) 
Galton- Watson process is nondecreasing with mean m, then, condition- 
ally on explosion, logX„+i ~ X„ • logm almost surely. This simplifies 
arguments of Gawel and Kimmel, and confirms and extends a conjecture 
of Pakes. 
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Introduction 

The parameters of an iterated Galton- Watson (IGW) process are a real number 
?? in (0, 1], called the thinning parameter, and a probability measure {pk)k^o on 
the nonnegative integers, called the reproduction law. The associated IGW pro- 
cess is a Markov chain (X„)„^o with nonnegative integer values, whose stochas- 
tic evolution is as follows. 

Let (Z'^"^)„^o denote an auxiliary i.i.d. collection of Galton- Watson processes 

with common reproducing distribution (pk) and starting from .^q"^ := 1. Let 

(£i"'')n^o. fe^i denote an auxiliary i.i.d. collection of Bernoulli random variables 
with common distribution 

P(e^"^ = 1) := i9 =: 1 - P(4"^ = 0). 

Assume that Xq, (Z'")) and (e^"'') are independent. Let S'i"'' denote the popu- 
lation up to level X of the nth Galton- Watson processes that IS, 

:=z["^ + --- + 4"). 

If X„ = 0, then X„+i := 0. If X„ = x ^ 1, then X„+i := e^"^ H he|,"\ where 

y := 5*1" In other words, the distribution of Xn+i conditionally on {A"„ = a;} 
and S'i"'' is binomial with parameters Si"^ and i?. 



1 



Gawel and Kimmcl (1996) introduced IGW processes to model the explosive 
growth of the number of repeats of DNA triplets in specific regions of the genome 
during heritable disorders such as fragile X-syndrome. Here, 1 + Xn models the 
length of a linear chain of DNA repeats after n replications, and what we know 
of the molecular mechanism of replication suggests to describe the evolution of 
{Xn) as above. (Gawel and Kimmel use a different convention and define the 
IGW process as (A'4)n>o with ;= 1 + Xn-) Indeed, simulations in Gawel and 
Kimmel when po = and •& ^ 1 suggest that IGW processes either die out or 
grow extremely fast after a period of relative quiescence. This is precisely the 
behaviour of the number of repeats of DNA triplets during some of these genetic 
disorders. See also the book by Kimmel and Axelrod (2002), which repeats the 
analysis of Gawel and Kimmel. 

We define the explosion F and death D of the IGW process as the events 

F := {Xn oo}, D := {Xn ^ 0} = {Xn = for n large enough}. 

Thus F and D are mutually exclusive. (In the context of genetic disorders, 
the death D of the process corresponds to the extinction of the diseased gene 
lineage, and the explosion F to the death of the patient.) Gawel and Kimmel 
show that P(Z?) = 1 as soon as po 7^ 0, and that P(F) = 1 when po = 0, pi 7^ 1 
and ■& = 1. Later on, their arguments were simplified by Pakes (2003). Pakes 
also conjectured that, whenpo = 0, pi 1 and = 1, (log A'„+i)/A'„ converges 
almost surely to logm, where m > 1 denotes the mean of {pk), that is 



In this paper, we determine the asymptotic behaviour of every IGW process. We 
confirm the conjecture of Pakes and extend it to IGW processes with thinning, 
that is, to the case i? ^ 1, and we refine partial results of Gawel and Kimmel 
which are not recalled above. 

Propositions n and El below state some simple facts about the mean behaviour 
of IGW processes and about their almost sure behaviour in some degenerate 
cases. Some of these are due to Gawel and Kimmel, or to Pakes. We write P^: 
and for the probability P and the expectation E, conditional on A'o = x. 

Proposition 1 The mean behaviour of the IGW process is as follows. 

(1) //m > 1, then E^(A"„) —>■ 00 for every x ^ 1. 

(2) If m < 1, or if m = 1 and d ^1, then E^(A"„) — > for every x ^0. 

(3) If ni ^ -d = 1, then Ej^(X„) = x for every x ^ 0. 

Proposition 2 The almost sure behaviour of the IGW process in some degen- 
erate cases is as follows. 

(4) IfpQ ^ 0, then ¥^{D) = 1 for every x > 0. 

(5) If po — 0, pi ^ 1, and ^ — 1, then ¥x{F) — 1 for every x ^ 1. 

The hypotheses of statements (1) and (4) are compatible, hence one can have 
^x{Xn) 00 and Fx{D) — 1 simultaneously, for every x ^ 1. This contrasts 




2 



with the behaviour of usual Galton- Watson processes. We turn to the non 
degenerate case that proposition |21 leaves out. 

Theorem A Assume that po — and pi ^ 1. 

(6) For every x ^ 0, Px{D) +Px{F)^l. 

(7) Assume furthermore that i!} ^ 1. For every x ^ 1, P^iD) and ¥x{F) are 
both positive, and ¥^{0) ^ Fi{D)^ . Thus, F^{F) 1 when x -> oo. 

We stress that (7) does not state that Fx{D) decreases geometrically when 
X oo. In fact, one can show in this case that 

log 1/P2.(_D) when X ^ oo. 

We now state our main result. 

Theorem B Assume that po = 0. Conditionally on the explosion F, the ran- 
dom variable (logX„+i)/X„ converges almost surely to logm. In particular, 
conditionally on F, Xn+i/Xn converges almost surely to infinity. 

As regards the process conditional on the death D, one could try to show that, 
if suitably renormalized, the death time, that is, the first hitting time of 0, 
converges in distribution, conditionally on D and when the starting point of the 
process goes to infinity. We do not pursue this in the present paper. 

In sections ^ and |21 we (re)prove propositions and [3 In section |2| we deal 
with the easy parts of theorem^ that is, (6) and the facts that the probability 
of death is not zero and that it is at most geometric. In section^ we provide 
explicit upper bounds of the probability of death in some specific cases. Sectional 
exposes a strategy of proof for the study of the explosion case. In sectional we 
apply this strategy, first to the proof of the remaining assertion of (7), that is, 
the fact that the probability of explosion is not zero, and finally to the proof of 
theorem 

1 Proof of proposition [T] 

From the construction of the IGW process, one sees that, for every x ^ 0, 
Ej:(Ai) = x(2;), where the sequence x is defined by x(0) := and, for every 
x^l, 

xix) := -d {m + ■ ■ ■ + m^). 

When m > 1, X can be extended to a convex function on [0, +cxd), which we 
still call X- (This step of the proof would be false for m < 1.) Thus, for every 
X ^ and every n ^ 0, 

E,(A„+i) ^ x(IE.(A„)). 

Choose xo ^ 1 large enough, such that xi^o) ^ 2x0. Then, E2,q(A„) > xq 2" for 
every n ^ 0. (This step of the proof uses the fact that x is nondecreasing) . For 
smaller values of x, for instance for x — 1, consider the event that A„ ^ xq- For 
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n large enough, n ~ uq say, this event has positive probabihty vq with respect 
to Pi. FinaUy, (1) holds since, for every n ^ uq and every x ^ 1, 

E,(X„) ^ Ei(X„) ^ P(X„„ ^ xo) E,„(X„_„J ^ wo xo 2"-"«. 

When m ~ 1, Kx{Xi) ^ d x thus the m — \ cases in (2) and (3) are obvious. 
Finally, when m < 1, E2;(Xi) ^ dmx and i9m < 1, thus E2;(X„) 0. This 
completes the proof of (2). 

2 Proof of proposition [2] 

The proof of (5) is direct since in that case, Xn+i ^ Xn almost surely. 

As regards (4), if pq ^ 0, for each ri ^ 0, the event that zj"-* = has positive 
probability po and these events are independent. Almost surely, one of them is 
realized, say Z^"^ = 0. Then S'i"'* = for every x ^ 1 and Xn+i — 0, which 
proves (4). The argument shows also that 

P.(X„ ^0) < (1-po)", 

for every n and x. In other words, the time to absorption is at most geometric, 
uniformly. 

3 Elementary parts of theorem El 

3.1 Almost surely, death or explosion 

This is straightforward, and analogous to the usual Galton- Watson case. One 
has to check that is the only non transient state of the Markov chain (X„). 
If Po = and = 1, this is true, see the proof of (2). If po = and ^ 1, the 
return to x ^ \, starting from x, assumes that the first step is not to 0. Thus, 
it has a probability at most 1 — Px(A'i =0) < 1, see section 

3.2 Probability of death, not zero 

When Xq — x, Xi = iff the thinning kills each and every S*!"-* individuals. 
Hence, for every x ^ 0, 

F^{D) ^ ¥,{Xi = 0) =E{{1 - df^), 

which is positive. 

3.3 Probability of death, at most geometric 

Assume that pq — and fix a; ^ 1 and y ^ 1. The fundamental branching 
property of the Galton- Watson process Z^"^ means that Z^"^y is the sum of 
zi"'' ^ 1 random variables distributed like Zy^K Hence is stochastically 
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greater than the sum of Si" and of an independent copy of Sy . After thinning, 
this shows that Xi under f^+y is stochastically greater than the sum of Xi under 
Pa; and of an independent copy of Xi under Pj,. By recursion over n ^ 1, the 
same assertion holds when one replaces Xi by Xn- Hence, 

P, + j,(X„ = 0) ^ P,(X„ = 0)Py{Xn = 0). 

This implies that Fx{D) ^ Vi{D)^ for every x ^ 0, an assertion of (7). 

Note that the important step here is to prove that Pi(-D) ^ 1. We do this in 
section 0] in some specific cases, the general case is in section IfTTl 

4 Upper bounds of the probability of death 

When po = 0, in the special case md > 1, one can bound explicitly Pi(£') by 
some q < 1, using the generating function of (pk)- Thanks to section 1^751 this 
proves that P^(D) ^ . 

4.1 Special case 

One can often get an upper bound of q := Pi (13) at small cost, as follows. Since 
the sequence (JPxiD))x^o is submultiplicative, 

g = Ei(PxiP)) s^El(g^l)■ 
The generating function of Xi is g(s) := Ei(s'^i) = /(I — i) + d s), with 

fis) 

Thus, q ^ g{q) and it is not hard to see that q is at most the smallest root of 
the equation s = g{s). In the supercritical case g'{l^) — rad > 1, s ^ g{s) for 
s = 1 and for s ^ g*, where g* in (0, 1) is defined by — g{q*). Finally, q ^ g*. 
In particular, F^iD) ^ 1. 

4.2 Binary case 

Assume that the Galton- Watson process describes a binary replication with 
efficiency A. Thus, /(s) := (1 — A)s + As^ and m = 1 + A. Elementary 
computations then yield the following. If > 1/m, g ^ q{X,'d) with 

g(A,^) :=(l-i?)(l-Az9)/(Ai?2). 

Note that q{X,'d) is in (0, 1), except when — 1, and then g — g(A, 1) = 0, and 
when 1? = 1/m, and then g(A, i?) — 1 but g < 1. 

5 A general strategy 

Assume that po ~ 0. We first explain our strategy for the study of the explosion 
of the IGW process. We fix an integer valued sequence {(p{x))x with (p{x) ^ x 
for every x, and an integer valued sequence {'ip{x))x. 
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Definition 3 Let C denote the event that Xn+i ^ ip{Xn) for every n ^ 0. For 
every x ^ 1, let rj^ := £i + ■ • • + e^(a;) denote a random variable of binomial 
distribution of parameters ^j(x) and-d. Finally, introduce the probabilities 

A{x) := P{S^ s; ^j{x)), B{x) := P(ry, ^{x)). 

To show that Pk(C) 0, we start from 

P,(Xi ^f{x)) ^A{x) + B{x). 
From Markov inequahty and from the fact that Sx ^ Zx, 

A{x) P(z, ij{x)) v^(x)E(i/z,) s; ^(x)e-^^ 

with a positive c that depends only on the distribution (pfc), see for instance 
proposition A. 2 in the appendix of Piau (2004). Likewise, Chebychev inequahty 
for Bernoulh random variables yields 

B{x) s; e'^(^) E(e-^)'''(^) ^v{x)-c' i,(x) ^ 

with a positive c' that depends only on d (for instance c' := i?^). Assume that 
there exists a nonincreasing sequence {•^{x))x, such that 

P,(Xi ^ ^{x)) «C 7(2;) < 1, 

for every x, for instance because 

V'(a;)e-'=^ + e^(^)-'=''''(^) =^7(2;), 

and define recursively (7fc(x))fc^o by 70(2;) := 7(2;) and 7fc+i(x) := 7fc((p(x)). 
Conditioning successively on the values of Xn and iterating the above yields 

^x(C) ^ [](l-7,(x)). 

The iteration uses both the fact that tf{x) ^ x and the fact that {--^{x^jx is 
nonincreasing. As a consequence, when (7^(2;))^ is summable and when every 
7fc(a;) < 1, the infinite product is positive. Finally, since C C F, Px{F) ^ Px{C) 
and Fx{F) is not zero. 

Using standard zero-one laws, this proves at the same time that, conditionally 
on F, the event that liminf X„/n ^ 1 is almost sure, since this is an asymptotic 
event which contains C. We refine this below. 

6 Some applications of the strategy 
6.1 Probability of explosion, not zero 

We first apply section |5l with ip{x) -.^ x + 1 and 'ip{x) x"^ . One can choose 
7(0;) < 1 for every x ^ \ and such that 7(0;) ^ e^^ ^ when x — > 00, with a 
positive c". Since 7fc(a;) = 7(fc + x), the series (7fc(x))fc is summable and every 
7fe(x) < 1. This proves that Vx{F) is positive for every 2: ^ 1, that is, the 
missing part of (7), and completes the proof of theorem IXI 



6 



6.2 Lower bound in theorem iBl 

Our second application of section is more involved. We choose < m and 
an integer sequence {ip{x))x such that ip{x) ^ fi'^ for x large enough. Then we 
choose v in (/i, m) and an integer sequence (jjj{x))x such that ip{x) ^ for x 
large enough. Since v > ii, the contributions {B{x))x are summable. 

As regards the contributions {A(x))x, we use standard estimates of the harmonic 
moments of Z^- Choose a positive r such that pi rrf < 1, this is possible as soon 
as m is finite and pi ^ 1. From Ney and Vidyashankar (2003), the behaviour 
of K{l/{ZxY) is ruled by the so-called Seneta constants. 

To keep things simple, we choose p in (v, m) and we use the following easy 
consequence of the results by Ney and Vidyashankar. There exists a finite 
constant cq such that ¥.{l/{ZxY) ^ cq p~^^ for every x ^ 1. Then, the Markov 
inequality for {Z^Y yields, for x large enough, 

A{x) ^ P(z, ^ ^ 4'{xYm/{ZxY) < CO {i^/pY'-'- 

Finally, the sequence {'^{x))x is allowed to decrease geometrically, hence to be 
summable. 

Since ^p{x) ^ x + 1 for a; large enough, 7^(2;) ^ ^(x + k) and the series (7fe(x))fe 
is summable for every x. Thus Px(C) ^ 0. Introduce 

G := {lim inf r„ ^ log y„ := (log X„+i)/X„. 

Since C C G and G is asymptotic, G is almost sure on F. Finally, conditionally 
on F, 

lim inf Yn ^ log m almost surely. 

6.3 Upper bound in theorem iBl 

As regards the other side of the equality of theorem fix > ni. We use 
the simple fact that, if Yn ^ log/i, then, conditionally on X„ = x, one has 
5^"' ^ Xn+i ^ p^. Furthermore, 

P(^x > A*") ^ M"" nSx) < (™/m)" m/(TO - 1), 

which is summable. Conditionally on F, liminfX„/n ^ 1 is almost sure, see 
the last lines of section |S1 This implies that the events that Y^ ^ log p are 
realized at most for a finite number of values of n, conditionally on F or not. 
Thus, lim sup y„ ^ ^ogp almost surely. This concludes the proof of theorem IbI 
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